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OO ! Abstract 

<N ■ 

A method is presented that reduces the number of terms of systems of linear equa- 
tions (algebraic, ordinary and partial differential equations). As a byproduct these 
systems have a tendency to become partially decoupled and are more likely to be fac- 
torizable or integrable. A variation of this method is applicable to non-linear systems. 
q \ Modifications to improve efficiency are given and examples are shown. This procedure 

can be used in connection with the computation of the radical of a differential ideal 
(differential Grobner basis). 

> 

: 1 Motivation 

^ ' Algorithms for applying integrability conditions to a system of differential equations in a sys- 
tematic way in order to generate simplified differential equations are implemented in a number 
of programs ( IH El EEH EEU U2j and more in |Hj). Such calculations result in the radical 
or a (pseudo) differential Grobner Basis of the differential ideal generated by the original 
system. A common problem of these algorithms, and consequently their implementations, is 
an explosive expression swell. Optimizations like Buchberger's 2 n< ^ criterion (section 5.5 in 
PP) and their analogue for differential equations aim to reduce the number of steps to reach 
a characteristic set (one step = computation of an S-polynomial for algebraic systems or a 
cross-differentiation of two differential equations for differential systems). These optimiza- 
tions do not cover other 'obvious' simplifications. For a very simple example, consider Df 
to be a leading derivative of a function / in two equations = Df + A, = Df + 2A with 
A a sum of a large number of terms. A simplification step in the standard procedure would 
aim at eliminating Df and get as a consequence the system = Df + A, = A where the 
big expression A occurs twice. An alternative to be described in this paper would be to try 
to shorten equations and therefore to get at first the system = Df + A, = Df and in a 
second length reduction step the system = A, = Df. 

For a slightly more realistic example, consider two equations = A + C and = B — C, 
where A, B, C are differential expressions, A, B having only few terms and C involving many 
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terms. If both equations are long and if they involve high derivatives then they would have 
a low priority to be used in standard algorithms. Typically, each of them would be paired 
individually with short low order equations for their reduction or the generation of integrability 
conditions. A simplification of both equations to the system = A + C and = A + B would 
usually not be found. Instead the expression C would grow in both equations when any 
substitutions of functions are made that occur in C. 

Both examples are clear cut situations where big expressions in different equations can 
cancel each other. Although in practical applications to be described in a later section each 
length reduction step saves only a small number of terms, these reduction steps can often be 
repeated many times. 

The need to reduce the length of equations comes from the danger of dealing with long 
equations in elimination algorithms. For elimination algorithms to be finite, i.e. to involve 
only a finite number of steps, they have to eliminate the leading derivative of equations first 
which involves differentiations and multiplications. Both tend to increase the length of long 
equations even further. However, elimination algorithms would remain finite if terms other 
than the leading derivative would be eliminated a finite number of times. Flexibility of this 
kind could be used to prevent excessive expression swell. In this way not only memory is 
saved. Long expressions also require an increased time to be computed which slows down any 
future computations in which long expressions are involved. 

The procedure to be described in section |21 is a first step in the direction of an 'intelligent' 
and more efficient computation. It aims at finding equations in the algebraic ideal of the 
given system with fewer terms. The basic idea is rather straightforward. Any pair of two 
equations of a given system of equations is checked whether there is a linear combination 
(with non- vanishing coefficients) of these two equations that is shorter than the longer of the 
two. If that is the case then the longer one is replaced by the shorter new equation. To find 
a length reducing linear combination of a pair of two equations, each term of one equation is 
divided by each term of the other equation, the quotient is simplified, i.e. common factors of 
numerator and denominator are dropped and a counter of the number of occurrences of this 
quotient is incremented. The quotient occurring most often is picked, its numerator and its 
denominator are the multipliers of both equations which are then subtracted from each other. 
By choosing the quotient that occurs most frequently a maximum number of terms cancel 
and the result is as short as possible. 

The main content of the first part of the paper is to introduce data structures (L, Cj below) 
which allow an efficient implementation and to describe some optimizations that speed up the 
method and that allow one to consider non-linear equations for length reduction. Beneficial 
side-effects are discussed such as the increased chance to find ordinary differential equations 
(ODEs) or to find exact differential equations in a length reduced system of partial differential 
equations (PDEs). 

In the second half, in section |3] the method is applied to determining equations of Killing 
vectors and Killing tensors in General Relativity. One example is explained in more detail in 
the appendix where the beneficial side-effects of length reduction become important. 

The length reduction module is incorporated in the package Crack for solving over de- 
termined PDE-systems. To show that the usefulness of the length reduction module is not 
just based on special features of Crack but is of a more universal nature a test is described 
at the end of section |21 In this test other well known programs are used to solve a system of 
PDEs before and after it has been length reduced. The suitability of length reduction as a 
pre-processing step is demonstrated. 
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2 The term reduction method 



Introduction 

The procedure to be described takes as input two expressions E X ,E 2 . Both represent equations 
= Ei — E 2 and are regarded as sums with n\ and n 2 terms (rii,n 2 > 1). The aim is to 
multiply each with a different single term and to add the expressions such that the result has 
fewer terms than max(ni,ri2) and can be substituted for the longer of Ei,E 2 . For a given 
system of equations this process is repeated with all possible pairs of equations until no pair 
can produce an equation that is shorter than the longer of both. The restrictions to multiply 
only with monomials and to combine only two equations at a time are non-trivial constraints 
(see the discussion in section HJ) . 

Example: The method will be illustrated with the following two expressions 

4 

E l = 2xf + 6yf + Axg + 5x = Y / Eu (1) 

4 

E 2 = 3yf-3xf + 6yg-7y = J2 E * ( 2 ) 

i=l 

f and g are the unknowns that are to be computed from = E\,0 = E 2 . To explain the 
method it does not matter if / and g would be replaced by derivatives of unknown functions 
or by products of powers of different derivatives as long as / ^ g. x and y are independent 
variables or parameters such that = Ex, = E 2 are to be satisfied for any value of x and y. 

The treatment of two equations 

Given are two expressions E\,E 2 with n\ and n 2 terms, n\ > n 2 . If each expression is 
multiplied with a single term (monomial) and both expressions are added, then their sum 
E% can have between ni — n 2 and n\ + n 2 terms depending on how many terms cancel each 
other. A way to find the optimal cancellation, i.e. optimal multipliers is to divide each term 
of expression E\ by each term of E 2 and to collect the simplified quotients (common factors 
of numerator and denominator dropped) together with the multiplicity they occur. 

Example: E\, E 2 given in CQ),© have 4 terms each = n 2 = 4) and the quotients are 



{Eii/E 2j ) = 
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7y 
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4xg 




2x 


4xg 


3yf 


3/ 


3y 


7y 


5x 


5 


5x 


5x 


3yf 


3/ 


6yg 


7y 



A new equation E 3 would be generated by picking a quotient, say E\ 2 jE 2 \ = and using 
its numerator and denominator to compute E3 = 3yf ■ E\ — 4xg ■ E 2 . Because this quotient 
involves / and g, the new equation = E 3 is non-linear in f,g. As a consequence after 
replacing E\ by E 3 the new system = E 3 = E 2 may not be equivalent to the old system 
= E\ — E 2 , i.e. E\ could not automatically be replaced by E 3 . The algorithm will therefore 
not consider quotients that involve any unknowns, here / and g. An effective method to avoid 
the computation of such quotients will be explained further below. 

The quotient occuring most often is || which is appearing twice and is free of / and g. 
Each appearance of a quotient means that two terms cancel. We therefore would expect that 
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in the expression E 3 := 3y ■ E\ — 2x ■ E 2 two times two terms cancel and therefore E 3 has 
4 + 4 — 2- 2 = 4 terms. But E 3 = 6fx 2 + 18fy 2 + 29xy has only 3 terms. When computing 
E 3 := 3y ■ Ei — 2x ■ E 2 the two terms 3y ■ Em — 2x ■ E 2i = 3y ■ 5x — 2x ■ (—7y) = 15xy + lixy 
add up to only one term 29xy. We could have forecast the saving of one additional term by 
realizing that the quotient E14/E24 = — 1| differs from EujE^x = §| only by a numerical 
factor. 

The example implies that we should record all quotients with the multiplicity they occur 
and group them into classes Cj where all quotients in a class differ by only a numerical factor. 
Finally, for each class q the sum Mj of all the multiplicities of all quotients in the class is 
recorded too. All classes Cj together with Mj are listed in a list L: 

L = ((d,Mi) J (c 2 ,M 2 ),...,(c r ,M r )) 

Ci = ((qu,mn), (qi2,m i2 ), . . . , (q iSz ,m iSi )). 

qij are the different quotients such that two quotients gy, q^ in the same class c, differ only 
by a numerical factor. 

S{ is the number of different quotients in the class q. 

rriij is the number of how often q^ occurs. 

Mi are defined as Mj = YljLi m ij- 

L is the complete (unsorted) list of all classes of quotients. 

Disregarding quotients involving / or g in the above example we have 

L=((((|,2),(-!,l)),3), 

(((-§,1), (2,1)), 2), 
(((-?,!)),!) 

with, for example, qn = || turning up twice, once as E n /E 2 i and once as E 13 /E 23 , therefore 
mn = 2, and further c x = ((g, 2), (-||, 1)), M x = m u + m 12 = 2 + 1 = 3. 
If a quotient is used to combine E\ , E 2 to 

£■3 = denominator(gj :) ') x E\ — numerator (q^) x E 2 

then the number n 3 of terms of £3 is 

n 3 = ni + n 2 — 2 x mjj (due to complete cancellations of 2 terms ) 

— (due to savings of one term each time) (3) 

= m + n 2 — rriij - Mi. 

The m ik in eqn. (J3J) comes from simplifications like 15xy + lAxy = 29xy which each save 
one term. In order to be successful and to replace E\ by E 3 we need to find a quotient q^ 
such that E 3 has fewer terms than Ex, i.e. n 3 — ni + n 2 — rriij — M < n i, hence rriij + Mi > n 2- 
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A pre-processing step 

As argued above only quotients qij should be considered which do not involve any unknowns, 
i.e. functions or constants that are to be computed from Ej = 0. An effective method to 
even avoid the computation of those quotients requires to re- write expressions Ei,E 2 in the 
following way. This initial re-writing step also clarifies how the method works for non-linear 
expressions Ei,E 2 . 

One always can regard non-linear expressions Ei,E 2 as linear homogeneous expressions 
in some newly defined variables v a which are linear or non-linear constructs of the dependent 
variables. For example, for independent variables x, y, dependent variables / = const, g = 
g(x,y) and 

E 1 = 3x + 3 cos(x) fg x - I2xyg + 6xg - yg + sm(g y ), (4) 
E 2 = l + Afg x -Ayg + 2g (5) 

the related system that is homogeneous and linear in Vi would be 

Ei = 3xv + 3 cos(x)v i — 12xyv 2 + 6xv 2 — yv 2 + v 3 , (6) 
E 2 = v + Avi - Ayv 2 + 2v 2 (7) 

with vq = 1, v\ — fg x , v 2 = g, and v$ = sin(g y ). After this re-writing the method will 
investigate the system ©,(0) and avoid quotients qtj that involve any v\. 

Methods to increase speed 

The restriction of not multiplying with factors involving dependent variables enables the 
following major speed up. Instead of investigating the system ® ,© and computing 6 x4 = 24 
quotients we investigate the system ©,© and compute only quotients between the terms of 
the coefficients of the same Vi in E\ and E 2 . This reduces the number of quotients to 1 x 1 
(for Vq) +1x1 (for v\) +3 x 2 (for v 2 ) +3 x (for t> 3 ) = 8 quotients. For large expressions, or 
more exactly for a high number of different v j the speed up is naturally much higher. If we 
have r + 1 new dependent variables v o, . . . , v r and if we denote the number of terms involving 
Vj in Ei as n^-, < j < r then instead of computing (X^=o n ij) x (Sfc=o n 2fe) quotients the 
more efficient method only computes Z)L=o ( n ij x n 2j) quotients. 

Another way of increasing efficiency is based on knowing the beforehand. For the 
terms involving a specific Vj, an upper bound on the maximal number of cancellations is 
mm(nij,n 2 j), saving twice as many terms. This value summed over j = . . . r (for each Vj) 
gives an upper bound on how many terms can be saved due to cancellations. 

This test can be performed without computing any quotients: 
If the new dependent variables are v , . . . , v r and if we have at the beginning 

r I r \ 

2 min(nij,n 2j ) <n 2 \ =J2 n ^ 

3=0 V 3=° J 

(n 2 + 1 is the minimum number of terms to be saved to reach a length reduction) then no 
length reduction is possible. 

This test of a necessary criterion is not only possible at the beginning but also during the 
computation of quotients. We assume that at first all quotients related to v are computed, 
then those related to V\, and so on. When the calculation has reached Vj and the first w terms 
in Ei that involve v j have been processed, i.e. all quotients between each of them and all terms 
in E 2 with Vj have already been computed then the following holds. At most min(nij —w, n 2 j) 
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more cancellations related to Vj are possible and an upper bound Bj(w) of the total number 
of any cancellations still to be found is given as 

r 

Bj(w) := min(nij - w,n 2 j) + ^ min ( n H> n 2i)- 

i=j+l 

At this moment any quotient q k i that has a chance to provide a length reduction must satisfy 

M k + m H + 2B j (w) > n 2 . 

All q k i which do not satisfy this condition can be dropped from the list L. For the same 
reason no new quotients should be added to L as soon as 2Bj(w) < n 2 . If the list L becomes 
empty at any time during the computation of quotients, then the search can stop. In that 
case no length reduction is possible. By dropping quotients from the list L, the updating of 
rriij and Mj speeds up. 

A speed up not recommended 

When deleting quotients from L that have no chance to give a length reduction then it is 
little extra effort to check whether any q k i of the remaining quotients in L already satisfies 
Mk + rriki > ri2, and therefore is guaranteed to provide a length reduction. As soon as such a 
quotient q k i is found the execution could stop. In practical tests it appeared that the negative 
effects of an early stop dominate. The first length reducing quotient q k i that is found does not 
have to be the one giving the highest length reduction possible. After using a sub-optimal q k i 
to compute E 3 and to substitute E\ no further length reductions may be possible, or even if 
further length reductions were possible, equations tend to have at least intermediately more 
terms compared with determining always the optimal quotient that gives the highest length 
reduction. After a suboptimal length reduction, subsequent pairings with other equations 
would be slower which would result in an overall slow down. It therefore is recommended to 
complete the computation of all relevant quotients and not to stop early when the first length 
reducing q k i is found. 

Some time tests 

The following tests can only provide some idea of the running times of the length reduction 
method. They are measured in a 8 MB Reduce 3.6 session running under Linux on a 
133 MHz Pentium PC (Dec. 1998). Equations which have been paired had been generated 
with the REDUCE command RANDPOLY which allows one to specify the number of terms to be 
generated. In RANDPOLY the randomly generated coefficients may become zero which in that 
case results in a polynomial with fewer terms than specified. Therefore the number of terms 
was chosen somewhat larger to be able to drop the surplus terms and get the required size of 
the polynomial and perform the following statistics. 

Performing a length reduction investigation involves no other risk or cost than the com- 
puter time that may be lost if no length reduction was possible. Therefore in figure 1 and table 
1 a statistics of investigations of each time two equations is shown where no length reduction 
could be found, i.e. the worst possible result. This will be referred to as unsuccessful pairings 
in contrast to successful pairings where a length reduction was possible. Both equations are 
random polynomials of degree up to 5 and with up to 8 variables. Times are obtained by 
averaging 20 runs. The individual times in these runs differ typically by up to 30%. The 
results confirm an overall dependence time oc (terms of eqn. l)x (terms of eqn. 2). 
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Figure 1. Running times for random polynomials of different sizes. 



no of terms of 1 st eqn. 
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time in sec (no success) 
if 2 n< ^ eqn. has f terms 
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1.36 


1.8 
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2.7 


time in sec (no success) 
if 2 n< ^ eqn. has 1000 terms 


4.18 


6.23 


9.35 


12.6 


15.4 


20.3 


24.4 


30.2 


36.4 


43.8 



Table 1. Timings of unsuccessful length reduction attempts of one equation with varying 
length and a second equation that has either 10 or 1000 terms. 



Test results shown in table 2 are based on pairing equations which are polynomials of 7 
degree with each 300 terms but with a varying number of variables. Times are averaged again 
over 20 runs. 

The effect of efficiency improvements as described in the above sub-section to detect the 
non-existence of length reductions early can be seen clearly from the second row in table 2. 
As more independent variables occur the number of different quotients qui increases and the 
average frequency for each quotient to appear becomes smaller. This in turn rules out many 
quotients early in the computation and it becomes clear earlier that no quotient will result in 
a length reduction if that is the case. 

Usually length reductions do not happen with random polynomials of that size. In order 
to measure computing times for pairings when length reductions were possible, pairs of poly- 
nomials had been constructed in the following way: a multiple of one random polynomial Pi 
of 300 terms is added to another random polynomial P2 and terms in excess of 300 terms are 
dropped to obtain a polynomial P3 with 300 terms. Length reductions between polynomials 
Pi and P 3 are investigated which produced the third row in table 2. Two trends seem to be 
present, one lowering the time with an increase of the number of variables (mainly effective 
between 4 and 5 variables) due to a decrease of potentially successful quotients and another 
trend slightly increasing the time with the number of variables. 



no of variables 


4 


5 


6 


7 


8 


time in msec (unsuccessful) 


4720 


2461 


1673 


1656 


1640 


time in msec (successful) 


5706 


4968 


5137 


5388 


5728 



Table 2: A comparison between average running times of unsuccessful and successful 
pairings of two equations. 
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To summarize this sub-section, the main feature found in this study is that computing 
times are lower in unsuccessful attempts to shorten equations and only when a length re- 
duction becomes possible, i.e. when computing times are of less importance, then they are 
increased. The speed up measures become increasingly efficient if more unknowns (like / and 
g in the first example) are present which is the case for partial differential equations with 
many different partial derivatives acting as different unknowns Vj during length reduction. 

The order of pairings of equations 

If more than two equations are given then the question arises in which order they should be 
paired to search for length reductions. Given that we combine only two equations at a time and 
multiplying them only with a monomial, we can not expect results that are invariant against 
combining equations in a different order. The following criteria serve only ggestion 
but they proved to be useful in applications. According to them pairs of equations are picked 
with the following priorities: 

• There should be as few as possible dependent variables Vi in the shorter equation which 
do not occur in the longer equation. 

• The shorter of both equations should be as short as possible. 

• The longer of both equations should be as short as possible. 

The first two rules maximizes the chance to find a reduction of terms. The third rule reduces 
computation times. The second rule has a higher priority than the third rule because the 
shorter the equations are, the more useful they are potentially in reducing the length of other 
equations. 

In the following table the above priority list is compared with the same list, only modified 
by exchanging in the first rule 'as few as possible' with 'as many as possible'. The equations 
are a set of first order partial differential equations (PDEs) resulting from investigating in 
General Relativity the Kimura metric j9] with respect to Killing tensors (see section 
Because sin and cos occur in these differential equations, both length reductions are performed 
once with the simplification rule cos(x) 2 =>■ 1 — sin(x) 2 and once with the simplification rule 
sin(x) 2 =>■ 1 — cos(x) 2 . It becomes apparent that these simplifications are not equivalent in 
their effect. 



simplification 
used 


original system 


length red. system 


length red. system 




using the rule 


using the rule 




'... as many as ...' 


'... as few as ...' 


no of eqn 


terms 


no of eqn 


terms 


no of eqn 


terms 


sin(x) 2 =^ 1 — cos(x) 2 


48 


607 


25 


117 


21 


81 


cos(x) 2 1 — sin(x) 2 


48 


464 


25 


108 


21 


74 



Table 3: A comparison of different simplification rules and different rules for the pairing 
of equations. 

As it was to be expected, the '... as few as ...' rule performed better (i.e. resulted in shorter 
length reduced systems) than the '... as many as ...' rule. What also becomes apparent is 
that choosing accidentally the less effective simplification rule sin(x) 2 1 — cos(x) 2 is less 
critical when length reduction is performed than without length reduction: 

no of terms of original system using cos-rule 607 

no of terms of original system using sin-rule 464 

no of terms of reduced system using cos-rule 81 

no of terms of reduced system using sin-rule 74 
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Beneficial side effects 

The reduction of length is not only useful for saving memory, and as shorter expressions are 
quicker to process later on, also for saving time. In this section we want to explain further 
benefits. 

1. Given a set of PDEs, the length reduced system is more likely to contain ordinary 
differential equations (ODEs) or to contain integrable exact PDEs. 

2. Length reduced polynomially non-linear equations are much more likely to be alge- 
braically factorizable. 

3. Length reduction of a system of equations has in general the side effect of partially 
decoupling the system. 

If the number of terms of an equation is lowered to, say, n and if the equation is linear then n 
is necessarily an upper bound for the number of different functions and different derivatives 
that occur. The length reduction method as described above is indiscriminate to different 
functions or different derivatives. On average therefore a consequence of a reduction of the 
number of terms will be a reduction of the number of different functions and the number of 
different derivatives that occur (see the comparison between tables 6 and 7 below) which in 
turn provides the above side effects. 

AboutlR 

If the number of terms in a length reduced equation got very small (say less than 4) then the 
chance increases that they involve only one differentiation variable, or that all derivatives can 
be looked at as derivatives with respect to only one variable of a common partial derivative, 
like d y f, d xy f, d xxy f are all ^-derivatives of d y f. In these cases the equation has been reduced 
to an ODE. (A proper ODE can of course only appear if the differential ideal of the original 
system does contain ODEs, but that is guaranteed for all the typical sources of over determined 
PDE-systems, like the computation of infinitesimal symmetries and of conservation laws.) 

Similarly the chance increases to obtain an exact differential equation. In order for a 
differential expression P(f l ) that involves functions f l and that satisfies = P to be a total 
x derivative of some expression I(f l ), the identity P = dl/dx has to be satisfied identically 
in all functions /' and in all their derivatives. The fewer different functions f % and the fewer 
different derivatives with respect to variables other than x occur, the less restrictive is this 
assumption of exactness and the more likely it will be satisfied. 

The conclusion is not that length reduction is the best way to proceed in order to integrate. 
We only say that the chance increases to find an integrable PDE in a length reduced system. 
In the appendix this statement is illustrated by a sequence of 10 integrations which a system 
of Killing tensor equations admits after being reduced in length. 

To explain the usefulness of integrability let us consider an extreme hypothetical example. 
Grobner Basis techniques aim at equations with a low differential order. This is a good 
strategy but not the only way to go. For example, knowing that = d 10 f/dx 10 is included 
in the differential ideal would be very useful as well. After integration, substitution of / and 
direct separation with respect to x (sometimes called splitting or fragmentation) a highly over 
determined system for the 10 functions of integration results. Although an expensive Grobner 
basis computation might have provided an ODE of lower order than 10, this information is 
usually gained faster by integrating = d 10 f/dx 10 and solving the over determined system 
that resulted from direct separation for the 10 functions of integration. The key to this speed 
up would be to sacrifice the minimal differential order for a reduction in the number of terms. 
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About\B 

In a small experiment we want to show that the chance for an algebraic factorization increases 
with a reduction of the number of terms. Bi-linear polynomials with up to 5 variables and 
coefficients in the interval —9, ...,—1,1.. .9 have been randomly generated for each length 
from 2 terms up to 6 terms. The percentage of factorizable polynomials is given in table 4. 
The chance to have a non-trivial factorization (non-numeric factors) decreases surprisingly 
quick with an increasing number of terms. Although this test is not proving anything it may 
serve as an illustration. 



no of terms 


2 


3 


4 


5 


6 


factorizable equations in % 


55.4 


13.6 


2.3 


0.22 


1.3 x 10~ 3 



Table 4: The chances for random quadratic homogeneous polynomials in up to 5 variables 
to be factorizable in dependence on the number of terms. 

About\^- 

The fewer different functions occur in each equation (on average) the more the system is (at 
least partially) decoupled and the fewer steps are needed in a subsequent computation to get a 
differential Grobner Basis or characteristic system. This means that an elimination algorithm 
performed afterwards has less work to do, i.e. needs fewer steps. A sparsely occupied system 
also opens the possibility to choose an appropriate total ordering on which an elimination 
algorithm is based. For example, if a function turns up in only few equations and with very 
few different derivatives then this function should get a high priority, i.e. the lexicographical 
ordering of functions which plays a role in any total ordering should give this function a high 
priority to be eliminated first because this will probably take the fewest steps to eliminate 
this function. 

3 Applications 

One of the applications to which the above length reduction procedure has been applied 
successfully is the computation of Killing vectors Ki(x p ) and rank 2 Killing tensors Kij(x p ) of 
given space times with a metric in General Relativity. Killing vectors and Killing tensors 
provide conservation laws of the form K(a l = const, and Kiju 1 ^ = const, where u % is the 4- 
velocity of geodesic motion in a curved space-time. If enough Killing vectors (i.e. symmetries 
of the space time) and Killing tensors are found then the equations of free (geodesic) motion 
of test particles in a curved space can be integrated which is a major step towards the physical 
interpretation of a space time metric. Killing vectors and their Lie algebras are also widely 
used to classify space time metrics. 

The determining equations for Killing vectors are 

A',, • <>• i,J = 1...4 (8) 

and for Killing tensors they are 

K ijil + K jl . i + K u . J = 0, i, j,l = l.. .4 (9) 

where ';' is the covariant derivative. Conditions ©,© are generated for a given space-time 
with metric tensor g^ by the program CLASSYM described in [THl- Examples are shown in 
table 5 (not selected from a larger set, but as they appeared in applications). The systems 
of equations determine either Killing vectors (KV) or Killing tensors (KT) for different space 
time metrics. One system is extended by integrability conditions (IC) and one system by 
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contractions (CT) with the metric tensor. The number of equations and terms before and 
after length reduction are shown. Running times in a REDUCE 3.6 session running under 
Linux on a 133 MHz Pentium PC are given in the right column. 



equations 


original 


shortened 


no of 
red. steps 


time 
in sec 


eqn 


terms 


eqn 


terms 


KV for Taub-NUT (pl70 in 5J) + IC 


49 


1929 


44 


932 


192 


120 


KT for the Kerr metric (pl61 in 


20 


1154 


20 


750 


36 


87 


KT for the Kimura metric |9J + CT 


48 


464 


21 


74 


93 


19.5 


KV for the Barnes metric [2] 


10 


66 


10 


39 


14 


2.3 


KV for pp waves (pi 78 in [3]) 


10 


58 


10 


37 


13 


1.7 



Table 5: The number of length reduction steps and the computation time for the length 
reduction of different Killing vector (KV) and Killing tensor (KT) determining systems of 
equations. 

The benefits of a length reduction for a program like Crack [13] are manifold. Tables 6 
and 7 show the number of terms and the occurring dependent variables in each of the Killing 
tensor equations for the Kimura metric before and after the reduction of length. 



no of 


A pr-\ pri A prif 




no of 


A pr-\ p ri A pri f 


tPTin 

LCI Alio 


vq tip nlp^ 

V Cl-L 1CX U1CO 




fprirm 

LCI Alio 


vp v\p\ nlp^ 

V Cl-L Id kJ 1*3 O 


9 


km knn 




Q 


k99 k1 1 kflfl kl^ kn^ k^ k?^ 


9 


k1 1 




Q 


k?9 ki 1 km knn kn? kns k^^ 


2 


k22, kl2 




9 


k22, kll, kOl, kOO, k02, k03, k33 


3 


kl3, k33, k23 




9 


k22, k33, k23 


3 


kl2, kOO, k02 




9 


kll, kOl, kOO 


3 


kOO, kl3, k03 




9 


kll, kOl, kOO 


3 


kll, kOl 




10 


k22, kl2, kll, kOO, k02, k33, k23 


3 


k22, kOl, k02 




11 


k22, kl2, kl3, k33, k23 


3 


kl2, kll 




11 


k22, kl2, kl3, k33, k23 


3 


kll, kl3 




12 


k22, kl2, kll, kOl, kOO, kl3, k33 


4 


kOl, k02, k03, k33 




14 


k22, kl2, kll, kOl, kl3, k02, k03, k33 


4 


kll, kOl, kOO 




14 


k22, kl2, kll, kOl, kl3, k02, k03, k33 


4 


kl2, kOl, k02 




18 


kl2, kll, kOl, kOO, kl3, k02, k03 


4 


kOl, kl3, k03 




18 


kl2, kll, kOl, kOO, kl3, k02, k03 


4 


k02, k03, k23 




18 


kl2, kll, kOl, kOO, kl3, k02, k03 


4 


k22, kl2, kll 




18 


kl2, kll, kOl, kOO, kl3, k02, k03 


4 


k22, kl3, k23 




19 


k22, kl2, kll, kOO, kl3, k02, k03, k33, k23 


5 


k22, kl2, k33, k23 




19 


k22, kl2, kll, kOO, kl3, k02, k03, k33, k23 


5 


kl2, kll, kl3, k33 




21 


k22, kl2, kll, kOl, kOO, kl3, k02, k03, k33 


5 


kl2, kl3, k23 




21 


k22, kl2, kll, kOl, kOO, kl3, k02, k03, k33 


8 


kll, kOl, kOO 




21 


k22, kl2, kl3, k02, k03, k33, k23 


8 


kl2, kll, kOO, kl3, k02, 


k03 


21 


k22, kl2, kl3, k02, k03, k33, k23 


8 


kl2, kll, kOO, kl3, k02, 


k03 


21 


k22, kl2, kl3, k02, k03, k33, k23 


8 


kll, kOl, kOO 




21 


k22, kl2, kl3, k02, k03, k33, k23 



Table 6: The original set of conditions for Killing tensors in the Kimura metric. 
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no of 


dependent 


no of 


dependent 


terms 


variables 


terms 


variables 


2 


kOl, kOO 


4 


k22, kl3, k33, k23 


2 


kll 


4 


kll, kOl, kOO 


2 


k22, kl2 


4 


kOl, k02, k03, k33 


3 


kl3, k33, k23 


4 


kl2, kOl, k02 


3 


kl2, kOO, k02 


4 


kOl, kl3, k03 


3 


kOO, kl3, k03 


4 


k02, k03, k23 


3 


kll, kOl 


4 


k22, kl2, kll 


3 


k22, kOl, k02 


5 


k22, k33, k23 


3 


kl2, kll 


5 


kl2, kll, kl3, k33 


3 


kll, kl3 


5 


kl2, kl3, k23 


4 


k22, k33, k23 







Table 7: The length reduced conditions for Killing tensors in the Kimura metric. 

The equations of table 7 (after dropping a single linear dependent equation with 4 terms) 
are explicitly given in the appendix. It is indicated there how the sparse occurrence of different 
derivatives leads to integrable ODEs and exact differential equations. 

Length reduction as an adjunct to elimination algorithms 

The main purpose of the term reduction method is to shorten and simplify systems of equa- 
tions without imposing any risk of an intermediate length increase. This risk is present when 
using standard Gaussian elimination or standard Grobner basis methods. For that reason 
and the benefits discussed above it would not matter if the term reduction method would be 
comparatively time consuming. The following run time tests show that term reduction can 
even be time saving if it is used in connections with computing differential Grobner bases or 
the radical of a differential ideal. 

The program rif [H],^2] has been applied by Allan Wittkopf to the Killing tensor con- 
ditions for the Kimura metric (see table 3, the shortened form is given in the appendix). 
Because we only provide the relative speed up in tables 8,9 the following hard and software 
specifications are not of much relevance. They are added only for completeness. Computing 
times of the program rif are measured on a PII 333 MHz Linux system with 384 MB RAM 
(although only about 3 MB was needed), running in XMaple release 5.0. 

Differential Grobner Bases have been computed, once in the generic case where the con- 
stant b is treated like a dependent variable and a second time where b is regarded as an 
arbitrary constant, ignoring b pivots. 

In the tables 8 and 9 kimul is the form of the system with 607 terms (see table 3), kimu2 
is a pre-optimal form of the system with 128 terms which resulted when a non-optimal pairing 
of equations was used and finally kimu3 is the system with 74 terms (see appendix). 



equations 


rif, with b pivots 


rif, without b pivots 


^kimu2 A kimul 
^kimu3 A kimul 


96.8 % 
86.6 % 


93.2 % 

72.3 % 



Table 8: Relative speed up of computing times (cpu time) for the program rif for different 
versions of the Killing tensor conditions of the Kimura metric. 

The Maple program diffalg has been applied by Evelyne Hubert (see [SI EJ Ej and 
the URL http : //daisy .uwaterloo . ca/~ehubert/Dif f alg). diffalg was run on a PC with 
dual Intel 400Mhz CPU's, 512 MB RAM, 128 MB swap, Asus P2L7-DS motherboard, 4.3 GB 
Seagate hard drive and Intel Ether Express Pro 10/100 network card under LINUX. 
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equations 


diffalg 


^kimu2 Akimul 
^kimu3 Akimul 


83.7 % 
40.1 % 



Table 9: Relative speed up of computing times (cpu time) for the program diffalg for 
different versions of the Killing tensor conditions of the Kimura metric. 

A potential gain of a reduction of the number of terms is the possibility to choose a more 
appropriate total ordering in a Grobner Basis computations when a system is already partially 
decoupled. For example, dependent variables which turn up in fewer equations could be given 
a higher lexicographical priority than other dependent variables. So far no advantage has 
been taken of this opportunity in the package Crack. 

4 Possible extensions 

The manipulations studied in this paper to achieve a reduction of length seem very special 
and one might seek a more comprehensive theory, for example, covering all possible linear 
combinations of any number of equations multiplied not only with monomials but with any 
number of terms. This problem is harder than it looks. It would include being able to decide 
whether any given linear algebraic system could be solved without intermediate memory 
increase. Already the question of combining 3 equations to kill at least 5 terms (4 terms could 
be killed by combining 2 equations twice) has a much larger search space than combining 2 
equations. Although not proven, the author expects that in generic practical applications 
the frequency of reducing the length of any one of 3 equations which can not be reached 
by combining repeatedly 2 equations is low (which, of course, may be different in specific 
applications). To give an example, the system = a + b — c — d + g, = c + d — e — f + h, = 
e + f — a — b + k gives a shortened equation only if all three equations are combined (added). 

It seems to be a general phenomenon that the computational complexity increases drasti- 
cally when any method of investigating systems of equations is generalized whereas the success 
rate of the generalized method increases only marginally. 

To give an example, in determining Lie-symmetries and conservation laws of generic par- 
tial differential equations (not the rare fully integrable PDEs) the success rate in finding 
symmetries when going from point symmetries to higher order symmetries or the success rate 
in finding conservation laws when going from zeroth order integrating factors to higher order 
conservation laws increases only marginally. On the other hand the complexity in solving the 
related over determined system of conditions grows drastically when the order of the ansatz is 
increased. The growth in complexity comes from the task to determine an increased number 
of constants or functions or in determining functions of more independent variables. The de- 
creasing success rate results from a relatively more over determined problem: having to satisfy 
not only the more restrictive conditions of a more general ansatz but also the conditions that 
the solution may not be decomposable into simpler cases. 

5 Summary 

Although conceptually simple, the method explained in this paper proved to be very useful. 
It is fast and it has zero risk of increasing the size of the system of equations during the 
computation or as a result of it. This is an important feature for very memory intensive 
computations. In such a case, computation times are of less concern. Still, it was possible to 
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show that even for other well known programs the time saving using term reduction could be 
higher than the additional cost. 

As discussed in the subsection 'Other benefits', size reduced equations on average are more 
likely to be an ODE or to be a total derivative and therefore easily integrable. They are also 
more likely to be algebraically factorizable. The fact that the system becomes partially decou- 
pled opens the possibility to select a total ordering according to which the system is already 
close to a characteristic form. This has the potential to reduce subsequent computations to 
obtain a characteristic system to a high extend. 
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7 Appendix: Length reduced conditions for Killing ten- 
sors of the Kimura metric 

The size reduced determining conditions for Killing tensors in the Kimura metric read: 



= k 22 , h + 2b 2 r 3 k 12 

= rkii^ r + 2/cn 

= fcoo,t - 2br 3 k 01 

= 2 cos(/i) sm(h)k 23 + k 33>p + 2 sin(/i) 2 6 2 r 3 /ci 3 

= rku !P + 2rk± 3 . r - 2k± 3 

= rk Ujh + 2r/c 12 , r - 2k l2 

= 2k 02 , h + k 22 , t + 2b 2 r 3 k i 

= 2rk i, r + rk llt t - 2k i 

= k 00>p + 2k 03tt -2br 3 k 13 

= k 00th + 2k 02 , t -2br 3 k l2 

= 6 cos(/i) sm(h)k 23 - sin(h) 2 k 22)P - 2 sin(h) 2 k 23yh + k 33yP 

= 2 cos(/i) sm(h)k 02 + 2k 03:P + k 33>t + 2 sin(h) 2 b 2 r 3 k 01 

= 2rk 12th + rk 22 , r + 2b 2 r A kn - Ak 22 

= 2 cos(h)k 03 - sm(h)k 2 !P - sm(h)k 03>h - sin(/i)/c 23it 

= rfcoij, + rk 03:T + rk 13jt - 4/c 03 

= rk 01ih + r/c 02 ,r + rk 12 , t - 4k 02 

= rk 00:r + 2rk 01>t + br 5 k Utr - 4k 00 

= 2 cos(/i) sm(h) 2 k 22 — 4 cos(h)k 33 — sin(h) 3 k 22> h + 2 sm(h)k 23 , p 



(10) 

(11) 
(12) 

(13) 

(14) 

(15) 

(16) 

(17) 

(18) 

(19) 

(20) 

(21) 

(22) 

(23) 

(24) 

(25) 

(26) 



+ sin(h)k 33 , h 

= 2 cos(/i) sm(h)rki2 + 2rk 13jP + rk 33 , r + 2 sm(h) 2 b 2 r 4 k u - Ak 33 

= 2 cos(h)rki 3 — sin(/i)r/ci2 iP — sin(h)rki 3t h — sm{h)rk 23;r + 4 sin(h)k 23 



(27) 
(28) 
(29) 
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One effect of reducing the number of terms is that fewer different derivatives of different 
dependent variables k^ occur in each equation. As a consequence the chance for the equations 
to be exact or to be simple ODEs increases. The package Crack is able to integrate 10 of the 
above equations and as a consequence to express 10 of the unknown functions kij in terms of 
new unknown functions of integration of only 3 variables: integration of (|11|) to solve for ku, 
of ((Hj) to solve for ki 3 , of (fTB]) to solve for ki 2 , of (fTUj) to solve for k 22 , of (fT7|) to solve for koi, 
of (fT2j) to solve for k 0Q , of |T8|) to solve for k 03 , of fT6j) to solve for k 02 , of (|28|) to solve for A; 33 
and of (|29|) to solve for k 23 . These integrations and substitutions are a first step in a longer 
calculation which finally gives the following general solution. The 13 free constants c\ . . . c\ 3 
stand for 13 Killing tensors, i.e. 13 conserved first integrals of geodesic motion in the curved 
space described by the Kimura metric. 

k 00 = (3b 2 rH 2 Cl + A8b 2 rHc 2 + 3br 2 a - br A c 3 - 3brH 2 c 4 - r 2 c 4 ) /(6b) 
koi = (brtci + 8brc 2 — rtc 4 ) / (26) 
k Q2 = cos(jo)r 4 Ci2 + sin(jo)r 4 Cn 

k 03 = (— cos(/i) 3 cos(j»)r 4 Cn + cos(/i) 3 sin(p)r 4 Ci2 — cos(h) 2 sm(h)r 4 c\ 3 

+ cos(/i) cos(p)r 4 Cn — cos(h) sin(p)r 4 Ci 2 + sin(/i)r 4 Ci 3 ) / sin(/i) 
k u = -c 4 /(36 2 r 2 ) 
fcu = 
k 13 = 

^22 = (6 cos(j») 2 r 4 C6 + 6 cos(p) sin(p)r 4 c 5 — 36 2 r 4 cit 2 — A8b 2 rHc 2 + 36r 4 t 2 c 4 
-6r 4 c 7 - 2r 2 c 4 ) /6 

k 23 = (2 cos(/i) 2 cos(p)r 4 c 8 — 2 cos(/i) 2 sin(p)r 4 c 9 + 2 cos(h) cos(p) 2 sin(/i)r 4 c 5 
—2 cos(/i) cos(p) sm(h) sin(p)r 4 C6 — cos(ti) sin(/t)r 4 cs — 2 cos(p)r 4 cs 
+2 sin(p)r 4 c 9 ) /2 

k 33 = (l2 cos(/i) 5 cos(p)r 4 cg + 12 cos(/i) 5 sin(p)r 4 cs + 6 cos(/i) 4 cos(p) 2 sin(/i)r 4 C6 

+6 cos(/i) 4 cos(p) sin(/i) sin(p)r 4 c 5 — 6 cos(/i) 4 sin(/i)r 4 C7 + 6 cos(^) 4 sin(/i)r 4 ci 

—24 cos(ft-) 3 cos(p)r eg — 24 cos(/i) 3 sin(p)r 4 cs — 6 cos(/i) 2 cos(p) 2 sin(/i)r 4 C6 

—6 cos(/i) 2 cos(p) sin(/i) sin(p)r 4 C5 + 3 cos(/i) 2 sin(/i)6 2 r 4 t 2 ci — 12 sin(h)r 4 cj 

+48 cos(/i) 2 sin(/i)6 2 r 4 tc2 — 3 cos(/i) 2 sin(/i)6r 4 t 2 c 4 — 6 cos(h) 2 sm(h)r cq 

+ 18 cos(/i) 2 sin(/i)r 4 C7 — 12 cos(/i) 2 sin(/i)r 4 cio + 2 cos(/i) 2 sin(/i)r 2 c 4 

+ 12 cos(h) cos(p)r 4 c 9 + 12 cos(h) sin(p)r 4 c 8 — 3 sin(/i)6 2 r 4 t 2 ci + 6 sm(h)r 4 c w 

—48 sin(h)b 2 rHc 2 + 3 sin(/i)6r 4 t 2 c 4 + 6 sin(/i)r 4 c 6 — 2 sin(/i)r 2 c 4 ) / (6 sin(/i)) 

What makes this result and the Kimura metric interesting is that two of these Killing tensors 
are non-trivial, i.e. they are not just symmetrized products of the 4 Killing vectors of this 
metric (see, for example jU]). 
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